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Abstract 



^■^ ^ • We establish a multivariate empirical process central limit theorem for stationary 

p ^ I valued stochastic processes {Xi)i>i under very weak conditions concerning the dependence 

structure of the process. As an application we can prove the empirical process CLT for 
ergodic torus automorphisms. Our results also apply to Markov chains and dynamical 
■ systems having a spectral gap on some Banach space of functions. Our proof uses a 

multivariate extension of the techniques introduced by Dehling, Durieu & Volny (2009) 
in the univariate case. As an important technical ingredient, we prove a 2p-th moment 
bound for partial sums in multiple mixing systems. 
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O ; 1 Introduction and Statement of Main Results 

O 

Let {Xi)i>i be an M'^- valued stationary stochastic process with multivariate marginal distri- 
bution function F[t) = P{Xi < t), t G W^. We define the empirical distribution function and 
^ ' the empirical process by 

1 

Fnit) := -#{1 <i<n:X,<t} 
n 

Unit) := V^{Fn{t)-F{t)), 

t e Mf^. Here "<" denotes the coordinate-wise ordering, i.e. {ti, . . . ,td) < (si, . . . , Sd) if and 
only if ti < Si for all i G {1, . . . , d}. In this paper, we study weak convergence of the empirical 
process towards a Gaussian process in the space -D([— oo, oo]'^). We make very weak assump- 
tions concerning the dependence structure of the underlying process (Xj)j>i. Effectively, we 
require a multiple mixing condition, the central limit theorem for partial sums of a restricted 
class of functions and a condition on the modulus of continuity of the distribution function 
F. Our results apply to Markov chains and dynamical systems whose transfer operator has 
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a spectral gap on some Banach space of functions. As most significant application, we can 
establish the multivariate empirical process CLT for non-hyperbolic crgodic torus automor- 
phisms, a system that does not have a spectral gap on common spaces of functions. 

The study of empirical processes was initiated by Donsker's empirical process invariance 
principle (see Donsker 1952), which covered the case of i.i.d. R- valued observations. Donsker's 
original theorem has been generalized to dependent variables by a number of authors, starting 
with work by Billingsley (1968) who could establish the empirical process invariance principle 
for functionals of uniformly mixing processes. Billingsley applied this result in his inves- 
tigations of statistical properties of the continued fraction expansion. Berkes and Philipp 
(1977/78) were able to treat the empirical process of strongly mixing sequences. Borovkova, 
Burton and Dchling (2001) could treat functionals of absolutely regular processes. Dehling 
and Taqqu (1989) proved an empirical process invariance principle for long-range dependent 
data. Dedecker and Prieur (2007) proved empirical process invariance principles for processes 
that satisfy one of the weak dependence conditions introduced earlier by the same authors, 
see Dedecker and Prieur (2005). These are generalizations of classical mixing coefficients, in 
a different way than Doukhan and Louhichi (1999); see also the recent book by Dedecker et 
al (2007) . Wu (2008) studied empirical processes in the case that the underlying process can 
be represented as a functional of an i.i.d. process (ej)jg2, i.e. Xi = f{{^i-k)k>o)- Wu and 
Shao (2004) investigated the empirical process for certain classes of Markov chains. 

In recent years, a lot of research has been devoted to the study of statistical proper- 
ties of data arising from dynamical systems. Given a measure preserving dynamical system 
(fi, J-,T, P), consider the process X„ := T(X„_i), n > 1. When T is a uniformly expand- 
ing map of the unit interval, this process can be represented as a functional of an absolutely 
regular process; see Hofbauer and Keller (1982). Denker and Keller (1986) used this represen- 
tation in their investigations of the asymptotic behavior of [/-statistics when the underlying 
data arise from a dynamical system. By combining this representation with coupling ideas, 
Borovkova, Burton and Dehling (2001) were able to study the empirical process and more 
generally, [/-processes. 

The spectral gap technique is a very powerful technique that allows to study much larger 
classes of dynamical systems. Let Q denote the Pcrron-Frobenius operator, defined by J / o 
TgdP = j fQgdP, where / G L^, g E Li. The spectral gap technique studies the dynamical 
system via spectral properties of the Perron-Probenius operator, viewed as operator on a 
suitable invariant subspace of Li. The spectral gap technique has been very successfully 
applied to the study of central limit theorems and large deviations properties; a survey and 
a large number of examples can be found in the monograph by Hennion and Herve (2001). 
It is possible to treat empirical processes within the framework of the spectral gap technique 
if the Perron-Probenius operator has a spectral gap on the space of functions of bounded 
variation, see Collet, Martinez and Schmitt (2004). In this case one can directly apply the 
standard proof of the empirical process invariance principle, i.e. establish finite dimensional 
convergence and tightness. This is essentially due to the fact that the indicator functions are 
functions of bounded variation. The situation is different when the spectral gap property can 
only be established on a smaller space of functions, such as Lipschitz functions. Recently, 
Gouezel (2009) has given a one-dimensional example of such a dynamical system. 

Dehling, Durieu and Volny (2009) have developed a new technique that is particularly 
useful when handling data from dynamical systems whose Perron-Probenius operator has a 
spectral gap in the space of Lipschitz functions. Instead of trying to deduce inequalities for 
bounded variation functions from those on Lipschitz functions, the authors proposed a proof 
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which only involves Lipschitz functions. In principle, this technique can also be applied to 
other spaces of functions, not just Lipschitz functions. The technique can also be applied 
to the study of Markov processes whose Markov operator has a spectral gap in the space of 
Lipschitz functions. The new technique uses classical chaining ideas, but replaces the indicator 
functions that are commonly used by Lipschitz functions. Dehling, Durieu and Volny (2009) 
make two assumptions concerning the process (Xj)j>o. For any Lipschitz functions / : M — )• M, 
the partial sums X]r=i fi-^i) satisfy the central limit theorem and a suitable bound on the 
4-th central moments. Under these two assumptions and a mild additional assumption on 
the modulus of continuity of the distribution function F, Dehling, Durieu and Volny (2009) 
could establish the empirical process invariance principle. 

In the present paper, we extend the techniques of Dehling, Durieu and Volny (2009) to 
multidimensional systems satisfying a multiple mixing condition. In the multidimensional 
case, the 4th moment bounds have to be replaced by bounds on higher order moments. We 
establish such a bound for multiple mixing systems. The multiple mixing condition has been 
used in the study of the statistical properties of dynamical systems, e.g. in the work of Le 
Borgne (1999) and Durieu and Jouan (2008) on non-hyperbolic torus automorphisms. 

Definition. Let {B, \\ ||) be a Banach space of measurable functions ip : M'^ — > M. We say that 
the process {Xn)n>o has a multiple mixing property with respect to B if there exist constants 
< 9 < 1 and r > 1 such that for any p € N \ {0}, there exist a positive constant C and an 
integer i such that the following assertions hold: for any ii,. . . ,ip G N and q £ {1, . . . ,p}, 
for any (p in B such that {ip) = and \\p\\oo < 1; 

|Cov((^(Xo)¥p(X,*) . . . (^(X,._ J, ifiX,,). . . ^{Xi*))\ < C||(^(Xo)||.||</.||^P(ii, . . . , v)0^^ (1) 

where P is a polynomial function of p variables which does not depend on ip and where we 
use the following notation: if (a„)„>i is a sequence of real number, a* is the sum (^i- 

In this paper, we will work mostly with the Banach space of bounded a-H61der con- 
tinuous functions, < a < 1. We define the norm 

\9{x) -g{y)\ 

llffll = llfflloo + sup — I _ 1^ • (2) 

x^yeRd F y\ 

Theorem 1. Let (Xj)j>i be an W^-valued stationary stochastic process satisfying the multiple 
mixing property with respect to the Banach space . Assume that for all 4> £ T~i°' , the partial 
sums X]r=i '^("'^*) satisfy the central limit theorem, i.e. that 



1 " 

-= V(0(Xi) - E4>{x, 



where a"^ = Var((/)(Xi)) -|- 2 Cov(0(Xj), 0(Xi)). // the modulus of continuity u of the 
distribution function F satisfies the condition 

oj{5) = 0(1 \og{5)\~"') for some 7 > r, (3) 

where r is given by (0]), then the empirical process central limit theorem holds, i.e. 

— oo,ooJ" 
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where " — >" denotes the weak convergence in the Skorohod space D{[—oc,oo]'^). 
Here (W{t))^^^_^ is a mean zero Gaussian process with covariance structure 

EWis)-W{t) = Cov(l(_^,,](Xo),l(_„,,i](Xo)) 

oo 

k=l 

oo 

+ Cov(l(_o^,,] (Xfc), (Xo)). 
k=l 

Further, almost surely, (W^(i))te[-oo,oo]'* ^'^•^ continuous sample paths. 

Notice that the assumption on the modulus of continuity of F plays a key role in the proof 
of the theorem. It allows us to control the indicator functions of the process from the control 
on the Holder observables. As mentioned by the referee of this paper, there are good reasons 
to believe that condition ([3]) is necessary if one only has assumptions on Holder functions. 

The proof of Theorem [T] will be given in two parts. In Section[2]we will establish a general 
empirical process CLT under the conditions of Theorem 1, but with the multiple mixing 
replaced by a 2p-th moment bound. In Section [3] we will show that multiple mixing implies 
this 2p-th moment bound. 

As an application of Theorem [T] we can establish the empirical process invariance principle 
for ergodic torus automorphism. We consider the torus T*^, identified to [0, 1]'^ and equipped 
with the Lebesgue measure, and define the automorphism T : T*^ — ?■ T"^ by 

T[x) = M X mod 1. 

Here M is a d x d matrix with integer entries and |detM| = 1. We assume that the ma- 
trix M has no eigenvalue which is a root of unity which is equivalent to ergodicity. Such 
torus automorphisms always have at least one eigenvalue of modulus strictly bigger than 1 
and then another one of modulus strictly smaller than 1. Thus a part of the action of the 
automorphism on the torus has some hyperbolicity. The automorphism is called hyperbolic 
if it has no eigenvalue of modulus one, and quasi-hyperbolic if it has eigenvalues of modulus 
one. For more details on torus automorphisms see Lind (1982). Le Borgne (1999) estab- 
lished the central limit theorem for quasi-hyperbolic torus automorphisms. Durieu and Jouan 
(2008) proved the empirical process central limit theorem for certain univariate functionals 
of quasi-hyperbolic torus automorphisms, i.e. they considered the empirical distribution of 
the sequence {f{T^ x))k>i, for / : T — >• M. In this paper we can establish the full empirical 
process invariance principle in the same case. 

Theorem 2. Let T he an ergodic automorphism of the d- dimensional torus. Then the em- 
pirical process 

d 

Unit) = V^iFn{t)-'[[ti), t = [t^, . . . ,td) G [0,1]'^ 
i=l 

converges in distribution to a centered Gaussian process (I^t)te[o,i]d which has almost surely 
continuous sample paths. 
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2 An invariance principle for the multivariate empirical pro- 
cess 

Let (-'^n)„>o be a stationary process with values in W^, d > 1. For i = (ti, . . . , t^) G M'^ and 
s = (si, . . . , Sd) G M'^, we use the notations 

t + S={tl+Sl,...,td + Sd), 

s < t 4^ Si < ti for all i = 1, d, 
s < t 4^ Si < ti for all i = 1, . . . ,d 

and for s < t, 

d 

[S,t] = '[\[Si,ti]. 

i=l 

As mentionned in the introduction, to generahze the result of Dehling et al. (2009) to 
multivariate processes, we need higher moment bounds. In this section, we will assume 
that the process {Xn)n>Q satisfies moment bounds on a given Banach space. The technique 
which is developed here is useful in cases where the Banach space does not contain the 
indicators functions. Our technique will work if the Banach space B is enough well adapted 
to approximate indicator functions. Typically, we could work with space of regular functions 
as the spaces of Lipschitz continuous functions, Holder continuous functions, or C'^ functions. 
Here, to have a link with the applications, we will work with the space H"^ of bounded a- 
Holder functions, for some fixed a E (0, 1]. This space is equipped with the norm ||.|| = 
defined in ([2]) 

Then, we make two assumptions concerning the process (Xj)j>o, 

1. For any function / G 'H°', the CLT holds, i.e. Yl'iLoCov{f{XQ), f{Xi)) converges and 

n 

Y^ifiXi) - Ef{Xi)) A iV(0, a% (4) 
where A^(0, cr^) denotes a normal law with mean zero and variance 

oo 

= E{f{Xo) - £;/(Xo))2 + 2^Cov(/(Xo),/(X,)). 

i=l 

2. For any p > 1, a bound on the 2p central moments of partial sums of (/(Xj))j>o, / G 'H°' 
with E{f{Xo)) = and ||/||oo < 1, of the type 

<Kj2nlf{Xo)tlog'^-\\\f\\+a) (5) 
1=1 

where K is some universal constant, a > 1 and r > 1. 
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In Section [3l we will show how this condition is implied by the multiple mixing property. 

Recall that the empirical distribution function t G M'^) and the empirical process 

[Unit], t G M'^) are defined by 

1 " 

Fn{t) = - Vl(_oo,t](^.), t e< 

1=1 

Unit) = V^iFn{t)-F{t)), teM^ 

where F is the distribution function of Xq and the modulus of continuity of a function 
f :M.d — ^^^y 

L0f{6) = sup [\f{s)-f{t)\ : s,tGM^|s-t| <(5}. 

Theorem 3. Let (Xj)j>o be an -valued stationary random process such that the conditions 
and ^ hold. Assume that Xq has a distribution function F satisfying the following 
condition, 

ojf{5) = 0{\ log(5)|~''') for some 7 > r, (6) 
where r is given by Then 

{Un{t))t(i[- 00, oo\'i ^ (^(O)te[-oo,oo]'^) 

where W{t) is a mean-zero Gaussian process with covariances 

EW{s)-W{t) = Cov(l(_^,,](Xo),l(_oo,t](^o)) 

00 

+ J2 Cov(l(_oo,s] (^0), l(-oo,t] (Xk)) 
k=l 
00 

+ J2 Cov(l(_^,,] (Xk), l(_oo,t] (^o))- 
k=l 

Further, almost surely, {W{t))^^^d has continuous sample paths. 

Remark 1. Here, if ^ holds for r = 1 as in Dehling et al.(2009), then in assumption ^ we 
can consider every 7 > 1 . This is an improvement of the corresponding theorem for dimension 
1 of Dehling et al.(2009). This is a consequence of the fact that we consider 2p-th moment 
inequalities (p > 1) instead of only a A-th moment bound. 

In this paper, we work with the Skorohod topology on the function space D{[— 00,00]'^), 
as introduced by Neuhaus (1971) and Straf (1972). In fact, these authors considered the 
space D([0, 1]*^), but we can easily extend their definitions since [0, 1]'^ and [—00,00]'^ are 
homeomorphic. Take any homeomorphism <j) : [—00,00] — )• [0, 1], e.g. 

1 1 

(f>{t) = — I arctan(t), 

2 vr 

and define <I> : [-00, oo]'^ —?■ [0, l]'' by 

$(ti,...,td) = ((/>(ti),...,(/>(td)). 
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The map $ induces a map that associates to any function / G D{[0, 1]'^) the function / o $ : 
[—00,00]'^ — )• R. We define D{[—oo,ooY) as the image of Z?([0, 1]*^) under this map. Neuhaus 
(1971) and Straf (1972) introduced a metric do on -D([0, l]'^) that generates the Skorohod 
topology and such that {D{[0, l^), do)) is a complete separable metric space. We can naturally 
extend do to D{[— 00,00]'^) by defining for 31,(72 G -D([— 00, 00]'^) 

doi9i,92) = do{gi o ^>"\s'2 o ^"^)- 

In what follows, we will denote the metric on D([— 00, cxo]'^) also by do- Note that do is 
bounded by the supremum distance, i.e. 

4(51,52) < sup \gi{t) - g2{t)\, 

00,00]'^ 

and that (!?([— 00, oo]'^), do) is a complete separable metric space. Note that the sample paths 
of the processes arising in this paper are elements of the function space D([— 00, 00]'^), since 
their limits as any of the arguments approach 00 exist. Alternatively one can see this by 
observing that e.g. the empirical distribution function Fn of the process (Xj)j>i is the image 
under the map defined above of the empirical distribution function of the process (<I>(Xj))j>i. 

Proof of Theorem 

To prove Theorem [3l we shall adapt the technique introduced by Dehling et al. (2009). 
The idea is to replace the indicator functions l(^_^^tj{x) by approximations in the space Ti". 

For each i = 1, . . . ,d, we denote by Fi the marginal distribution functions of Xo corre- 
sponding to the i-th coordinate. Note that the Fi also verify condition ([6]). 
Given a partition of [0, 1], 

= ro < . . . < rm = I 

we define 

U,j=Fr\r,) 

where F~^ is given by 

F~'^{t) = sup{s G M : Fi{s) < t}. 
Thus, by continuity of the Fi, we have subdivisions 

-00 < tj_o < • • • < ii,m = +00. 

If j = Ui, • • • ,id) e {0, . . . mY, we set 

~ • • • ' ^d,ja)- 

We introduce the functions ipj : M'^ — M, j = (ji, . . . ,jd) G {1, . . . m}'^ defined by 

iJ[ \\ti,n-l - / 

otherwise 
with 

(p{x) = l(_oo -1] (x) - 2;l(_i,o] (x) (7) 



ipj{x) = < 
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and where we eventually used the convention that ^ = 0. 

Note that is a a-Holder function on M (for all a € (0, 1]). The function ipj will serve as a 
"H^-approximation to the indicator function l(-oo,tj_i]- 

Now, we introduce the process 

1 " 

i=l je{l,...m}'i 

= E (^E^.(^^))ife-.*.)W- 

je{l,...m}'* V i=l / 

Note that Fir\t) is a piecewise constant approximation to the empirical distribution function 
Fn{t). For t £ [tj^i,tj), we have the inequality 

Fnitj-2)<FtHt)<Fnitj.i). 

We define further 

m 

and finally the centered and normalized process 

C/^'") (t) = ^ (^F^H (t) - (f)) , t e M^. (8) 

Theorem [3] will follow by application of the following Theorem which is proved in Dehling et 
al. (2009). 

Theorem. Let {S,p) be a complete separable metric space and let X^, xii^^ and X^"^\ 
n,m > 1 be S-valued random variables satisfying 

X^™) A asn^ oo,Vm (9) 

lim limsupP(/)(X„,X^™)) > e) = 0,Ve > 0. (10) 

Then there exists an S-valued random variable X such that 

A„ — > X as n ^ oo. 

Here we work in the complete separable metric space {D{[— 00,00]'^), do). We shall prove 
separately that Q and (jlOp hold for the !)([— cxd, oo]'^)-valued random variables in 
Proposition [1] and Proposition [2j 

Proposition 1. For any partition = tq < . . . < = 1, there exists a piecewise constant 
Gaussian process (VF(™)(t))^g^, such that 
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The sample paths of the processes {W^"^\t)^^^^j^ are constant on each of the rectangles 

[tj-i,tj), 1 < j < m, and VF(™)(0) = 0. The vector {W'^'^\ti), . . . ,W^"'\tm)) has a mul- 
tivariate normal distribution with mean zero and covariances 

Cov(W^('")(ti_i),VF("^)(t,-_i)) = Cov(vPi(Xo),v?j(Xo)) 

oo 

k=l 
oo 

+ J]]Cov(v7i(Xfc),^j(Xo)) 
fc=i 

Proof. 

Use (jl]) and the Cramer- Wold device. □ 
Proposition 2. For any e, > there exists a partition = rg < . . . < = 1 such that 



limsupP sup 



Un{t)-Ut\t) 



> e ] <r]. 



Proof. 

Prom here, we assume the partition = rg < . . . < = 1 is a regular partition of step 
h = m~^. Let j = 1,. . . ,m. On the interval [rj_i,rj] we introduce a sequence of refining 
partitions 

(fc) ^ (fc) / / (k) 

by 

For each i G {1, . . . , d}, let us define 

^, = F^'i^) > o<^<2^ 

We now have partitions of tjj], 



_ (fc) (fc) ^ (fc) _ ^ 



For convenience, for j > 1, we also consider the points 

and for j < m, we consider the points 

For any t G [tjj-i, tj^j) and A; > we define the index 

kj{k,t) = maxj/ : s-^-^^ < t| . 
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Now fix j G {1, . . . , m}'^ (then the index related to j will be forgotten). For / = (/i, . . . ,1^) 
and t E we write 

Sk) _ ((k) (k) . 



and 



l{k,t) = {lij^{k,ti),. . . ,ld,jai^,td)). 
In this way we obtain a chain, 

^ _^(o) <^(i) < <i<5W 



hnking the point tj-i to t. Note that for t G we have by definition U^\t) = 

We define the functions '4^\^\ k > 0, I ^ {0, . . . , 2^^ + l}'^, in the following way : We first 
define, for i = 1, . . . , d and / e {0, . . . , 2^" + 1}, 



r 

1 



(k) 



Jk) _ (k) I 
i,ji,l i,ji,l-V 



if ji = 1 and I = 
if ji = m and / > 2^ 

otherwise 



where tp is defined as in ([7]) (eventually, we use ^ = 0). Then we set, for / = (/i, . . . , l^) G 
{0,...,2'= + l}^ 



d 



1=1 

M J,(k) ./,{0) 



Observe that by definition of s^^^ ^-^ and of V';(o t)(^) ~ ^ji^) ^^'^ 



In this way we get 

Fr,it)-FtHt) 



k=l i=l 
1 " 



(11) 



where iiT is some integer to be chosen later. 

From (jlip we get by centering and normalization 



k=l i=l 



+4E{(Wi(^0-^(0) 
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For the last term on the r.h.s. we have the following upper and lower bounds, 

* 1=1 

i=l 



and 



Now choose K = A + 



i=l 

>~^{F{t)-Ei^\^l^^{Xi)). 
log (d^^) log-i(2) 



and note that 



We thus have 



< 



Therefore, since 
we get for all t G [tj-i,tj], 

K n 

u^{t)-ut\t)\ < Et^ E{«m)(^^)-Mm)(^^)) 

fe=l ^ 1=1 

n 

E{Kw(^o-M^!t)+2(^o) 



+ 



e 

+ 2- 



Note that by definition of l{k,t) and of sp'', we have 
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where the integer part [.J is taken on each coordinate. We infer 

K 



sup 



k=l ^ 



max 
ie{o,...,2''}'* 



i=l 



-\ — — max 

^/n «e{o,...,2-ff}rf 



i=l 



e 

+ 2- 



Now, taking 



4fc(fe+l) 



we obtain 



sup 


Un{t)-Ut\t) 




W,_i<i<i, 







K 



-(<,r(x<)-£W"'™)}|>|)- 

At this point we shah use Markov's inequahty at the order 2p together with the 2p-th moment 
bound ([5]) for an integer p such that 



p > d- 



r7 
7 — r 



(12) 



First, remark that these following bounds also hold. Since ih-Z—ih^ , vanishes outside 
[s^'^ ^] , s[^} ; 1, we have 



lik-l). 



< 



E 



(fe-i) 



< (imax|F,(sg_^; 
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and in the same way 



v^j5(xo)-vr^(^o) 



/3dh 

- VW 



Now by ([6]), if A; is big enough, we have 



(fc) 



1 



< l + d 

< l + d 



inf <{ s > : 3i e {1, . . . ,d}yt, Fi{t + s) - F,{t) > ^ 



l + d exp Q 



inf <^ s > : i:>|log(s)r'^ > 



D2''\ 'I 



2k 



h 



Thus, there is a positive constant B such that for arbitrary k > 0, 

/ . 1 \ 



(fc) 



< i?exp a 



h 



Therefore, applying successively Markov's inequality at the order 2p, the 2p-th moment 
bound dS]) and the preceding inequalities, we get 



P sup \Un{t)-Unitj)\>e 



,tj — l^t^tj 



< 



< 



-nM -p log^P M a + 5exp a 



e2p n'P \ 



p K 
i=l fc=l 

k=l 



h 



2p-i 



2k ^ 7 



< 



p-1 



i=l £ T 



fc=i 



where C always denotes a positive constant, but its value changes from line to line. 

By Condition p2p . the series converges and there exists an ^ > such 



that for alH G {1, . . . ,p — 1}, 



k=i 



i 2p — i 

d- - + — 2{p-t) < -A. 

r 7 
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Finally, using mh = 1, we have 



P I sup 

V0<i<l 



Un{t)-Ut\t) 



> £ 



< E ^ 

je{l,...,m}'' 

1 



sup 

t,-l<t<tj 



< C{p-2)- 



e 



Ap+d 



Unit)-ujr\t) 

4 + logfd^ 



4p+l 



+ c- 



hr 7 



t2p 



The first summand converges to zero as n — )• oo and, since ^ — d — ^ > 0, the second can be 
made arbitrarily small by choosing a partition that is fine enough (i.e. h small). □ 

Remark 2. The tightness of the empirical process can be proved using exactly the same proof 
than the one of Proposition 2.3 in Dehling et al. (2009). The almost sure continuity of the 
limit process follows. 



Remark 3. Assume that conditions ^ and (Ejj hold for a space of -functions instead of a 
space of Holder continuous functions. It is clear that the same technique works (in the proof, 
take for example ip{x) = l(_oo,-i](^) ~ sin (vrx + |) l(_i^o] (^)/'- 



3 Moment bounds for partial sums 

For a function ip:R'^ — ^ M, we consider the partial sum 



n-l 



i=0 

Multiple mixing properties allow us to obtain some useful moment inequalities. In this 
section, we show how the multiple mixing property implies the 2p-th moment bound which 
is required in Theorem [3j 

Theorem 4. Let {Xn)^yQ be a stationary process having a multiple mixing property on B 
and if & B such that (if) = and sup^ ^ 1- Then for all p > 1, 



Sn {iff"] <kY, n'\W (Xo) t log'^'-WM + e-'] 



i=l 



and 



i=l 



where K is a constant which does not depend on n or if and r > 1 is given by 
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Proof of Theorem 14]. Let us consider the assumptions of Theorem H] hold and let <p €z B 
with E (if) = and sup^ Iv'C^;)! < 1 be fixed. We use the notation a* = Y17=i '^i- 

Notations: For all p > 1, we define 

Inip)= Yl m^{XoMX,,)...ipiX,;))\ 

< ii, . . . ,ip < n — 1 
il <n- 1 

and/„(0) = |E(vp(Xo))| =0. 

As the process is stationary, for p > 1, we have 

\E[Sn{'pn<p\nIn{p-l). 

So, to prove Theorem HI it is sufficient to prove the following lemma. 
Lemma 1. For all p > 1, 

U2p - 1) < Kj2^'~'\\v (^o) lit log'p-'m + 0-') 

and 

In{2p) < KY,n'-^\W{X,) ||;iog2^'-+nilv=ll +^"') 

i=l 

where K is a constant which does not depend on n or if. 
Notations: For all p > 1 and q £ {1, . . . ,p}, we define 

n-l 

Up, q) = Yl E |E(v.(Xo)(^(X,*) . . . v{X,*))\. 

*''=° < ii, . . .,iq-l,iq+l, ■■■,ip<iq 



ip < n — 1 



We have 



In{p) < YJn{p,q)- 

9=1 

To prove Lemma [H we will use the following lemma. 
Lemma 2. For all p £ N* and q G {1, . . . ,p}, 

Jn(.P,q) < CMXo)\\rlogP{M\+e-^)+nIniq-l)In{p-q), 
where C is a constant which does not depend on n or if. 

Proof of Lemma [H 

Let no be a positive integer such that 

log(||y^||+0 ^ ^ log(||y||+g-i) ^ ^ 
— log 9 ~ — log 
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We thus have the inequahty ^^^^Hc/jH < 1 and no > 2. 
We have 

n-l 

Jn{p, q)<Yl Yl [^h,-,ip + Bn,...,i,] 

*9=° < . . .,ig-i,ig+i, ...,ip<iq 
ip <n — 1 

where 

Ai,,...,^ = |Cov(v^(Xo)v?(X,*)...v^(X,*_J,v9(X,.)v.(X,*_^J...(p(X,;)| 

and 

Using Holder inequahty for = to uqI — 2 and multiple mixing property ([T|) for ig > uqI 
(where i comes from ([T])), we obtain 

n-l 

*9=° < ii, . . . ...,ip<ig 

i*<n-l 

no£—2 n 

<c ^ (i, + i)p-i||^(Xori|ii+c {^, + lr-'4v{Xo)\\r9'^y\fQ{^) 

iq=0 iq=no£—l 

oo 

<C{no-irMXo)\\r + CKMXo)\\r ^ (i, + If" ^^''"""^QCg 
where Q{ig) = Ylo<u ...i <i Pih,---,ip) is a polynomial function of ig. Then 



.iO<ii,...,iq-l,iq+i,...,ip<i 



Y2T=o9^''Qih) converges and we deduce that 



^ (ig + ir-'e'^-'^^'Qniig) < Cim - If-' 
iq=noi—l 

where C is independent of uq. Thus, since uq — 1 < Clog{\\ip\\ + 0~^), 

n-l 

E E < CKMXo)\\rlogP{M+^ 

'9=0 < ii, . . . ...,ip<ig 



On the other hand, 



< zi, . . . ■ ■ ■ ,ip<iq 

ip<n—l 

Therefore, 



Jn(p, g) < CKyiXo)\\r log^'(||(^|| + ^ ^) + n/„(g - - q). 



16 



□ 

Proof of Lemma [7J 

We proceed by induction. We have, = Jn(l) !)• Then, by Lemma[21 

I„(l)<C||v9(Xo)||.log(||(^||+ri). 
In the same way, /n(2) < Jn(2, 1) + Jn(2, 2). Then, by Lemma [21 

ini2)<cMXo)\\riog'{\M+e-'). 

In the general case, by Lemma [H 
p 

In{p) < ^Jn{P,q) 
q=l 

p-l 

< CpMx^)\\r \ogP{M\ + e-^) + nY, Uq - ^)Up - q)- 

q=2 

Studying YTq=a ^n{q — ^)In{p — q) according to the parity of p, we deduce the inequahties of 
Lemma [TJ □ 

4 Markov chains and dynamical systems with a spectral gap 

Let {Xn)j^yQ be a homogeneous Markov chain with a stationary measure z^. Denote by P 
the associated Markov operator and E the state space. Consider a Banach algebra {B, ||.||) 
of z/-measurable functions from E to M, which contains the function 1 = 1e and which is 
continuously included in (v) , \\.\\s) for some s £ [l,+oo], 
i.e. 3C > such that V/ € B, 

<C\\f\\. (13) 



We say that the Markov chain (^n)„>o is B- geometrically ergodic or strongly ergodic (with 
respect to B) if there exist k > and < 9 < 1 such that for all / G ^B, 

IIP"/ - n/ii 11/11 (14) 

where Uf = (/) 1. 

Strong ergodicity corresponds to the fact that the Markov transition operator acting on B 
has 1 as simple eigenvalue and the rest of the spectrum is included in a closed ball of radius 
strictly smaller than 1 (see Hennion & Herve 2001). 

Lemma 3. Let {Xn)n>o ^ B- geometrically ergodic Markov chain, then it satisfies the 
multiple mixing property ^on B with r = (r = 1 if s = oo). 

Proof of Lemma [3 
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Let Fi be the cr-algebra generated by the Xj, j < i and let if belongs to B such that 
(99) = and \\(p\\oo < 1- Using the operator properties, we have 

|Cov(v.(Xo)^(X,.) . . . . . . ip{X,*)\ 

= \E [p{XoMX,^^). . . ipiX,*J [Eiip{X,*)E{fiX,*J . . . E{f{X,*)\T,;J . . . J 

-E{f{X,,)ip{X,,^J...ipiX,,)) 

<MXoMX,^)...ip{X,*_J\\r 

||P^^((^P^^+l(<^ . . . P'^ip)){Xo) - 7T{cpP'^+' (99 . . . P^^V9))(Xo)||. 

< c^ifiXoyWre'-^yp'-^+'if . . . p*-<^)||. 

Further, 

and the result follows. □ 
The corresponding result holds in the setting of dynamical systems. Let {il.,A,fi) be a 
probability space and T a measurable measure preserving transformation. Let us consider 
the Perron- Frobenius operator (or the transfer operator) of T, P : >C^(/u) — > >C^(/u) defined 

by 

Pf{x)g{x)dfi{x)= / f{x)goT{x)dfi{x) 



for all / e C^in) and g E 

We assume there exists a Banach algebra {B, ||.||) of /x-measurable functions from Q to 
which contains 1 and satisfies (|13p and that P verifies: 
there exist k > and < 9 < 1 such that for all / G 



||p"/-n/||<Kri 

where n/ = E^(/)l. 

Lemma 4. Let f £ B and Xi = f o T*, i < 0. Then (Xj)j<o satisfies the multiple mixing 
property U^) on B with r = (r = 1 if s = 00). 

For both setting, if the space B is enough well adapted to approximate indicator functions, 
then Theorem [3] applies. Examples of Section 4 in Dehling et al. (2009) can be generalized in 
higher dimensions. In particular, we can state a result for multidimensional linear processes. 



Linear processes. Let {Ci)i>z be a sequence of i.i.d. random variables in with ||Co||oo < 
00. Let (ai)j>o be a sequence of endomorphisms of such that ||ai||oo < (for a 6* < 1). 
Define the linear process = X^j>o ^^^(Cfc-i) aiid assume that its distribution function F 
satisfies condition dH). Then we can show that the process {Xk)k>o satisfies condition (fT4]l 
for the space of bounded Lipschitz functions on M'^. We deduce conditions (jl]) and ([5]) and 
Theorem [3] leads to the following corollary. 

Corollary 1. If the distribution function of Xq verifies condition |^ with r = 1, then the 
multivariate empirical process associated to (Xfc)fc>o converges in distribution to an almost 
surely continuous Gaussian process. 

This result was already proved by Dedecker (2010), using a different technique. 
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Random iterative Lipschitz models. Here, let us focus on application concerning ran- 
dom iterative Lipschitz models. This example has been investigated before by Wu and Shao 
(2004) or Dedecker and Prieur (2007), with different techniques and under different conditions. 
Here we want to show that our technique also applies in this situation. 

Let g : M'^ X M — > be a measurable function and let (l^)n>i be an M-valued i.i.d. 
process. Let Xq be an M*^- valued random variable independent of {Yn)n>i- Define the Markov 
chain (X„)„gN by 

Xn = g{Xn~l,Yn), n > 1. 

Assume that for all y € M, g{.,y) is Lipschitz. Define the Lipschitz constant 

T^f N \9{x,y) -g{x',y)\ 
K{y) := sup , _ 

x,x'£E,Xy^x' 1-^ X I 

and suppose that there exists 70 > 1 such that 

^[(1 + K{Y^) + |5(0,yi)|)^°+i(l + K{Y))] < 00 

and 

E[K{Yi)maic{K{Yi),l}^^'>] < 1. 
Let 7 G (0, 7o] and consider the Banach space of functions from to M satisfying 

m^[f) = sup I 1 . . ^^ < 00 

where p(x) = The associated norm is ||.||^ = N^{.) + my{.) with N^{f) = sup^ • 

According to Hennion and Herve (2004) Theorem 5.5, the Markov chain has an invariant 

probability measure v such that z^(|.|'''"+-'^) < 00 and for all 7 € (0,70], the chain is B-y 

1 1 
geometrically ergodic. Further, if / G i^(|/lo)^o < Nj{f)u{p'~'°^'~''^^>)''o . Then for 71 = 

B-y-^ is continuously included in L'^o(z/). By Lemma [3l {Xn)n>o satisfies a multiple mixing 

property on B^-^^, for r = ^^^^j- Note that if / is a bounded Lipschitz function on then 

/ E B-y-^ and WfH-yi < ||/|| (where ||/|| = sup|/| + mo(/)). Thus we have the multiple mixing 

property for the space of bounded Lipschitz functions. As a corollary of Theorem [3] we get: 

Corollary 2. Assume that the distribution function of v satisfies ^ for r = ^J^-^ . // 
the distribution of Xq is v, then the empirical process associated to (A'„)„>o converges in 
distribution to an almost surely continuous Gaussian process. 



5 Ergodic torus automorphisms 



Let T be an ergodic automorphism of the torus of dimension d and ^ the Lebesgue measure 
on T"^. Then we have the following multiple mixing property. 

Proposition 3. There exist C>0, 0<7<1, for all m,p £ N*, for all bounded a-Holder 
function ip (a £ (0, 1]) with \\(p\\cx) < ^, for all fei < ... < fcm < < /i < ... < Ip, for all n £ N, 



Gov I J|(/jor'=^ JJ(/9or'^+" 



< C||(/?||i||(/?||^<.P(/!:i, . . . ,km)Y' 



(15) 
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where P{ki, . . . ,km) = X^i^i l^iT with r the size of the biggest Jordan's block of T restricted 
to its neutral subspace. 

Almost the same proposition appears in Le Borgne and Pene (2005). The shghtly modi- 
fication here is that we keep the L^-norm appearing in the upper bound. 
Proof. 

Denote by E'^ , and E'^ the T-stable subspaces of M'^ corresponding respectively to the 
stable , the unstable and the central directions of T (where T is identified to its representative 
matrix). We have 

M.'^ = E" ® E"" ® E" 
and there exists A > 1 such that for all n > 1, 

|r"w| < A~"|v| for ah V E E\ 

|r"u| > A"|^;| for ah v € 

|r"i;| < n^wl for all v G 

where |.| denotes the maximum norm on and r is the size of the greatest Jordan block of 
T restricted to the space E'^. Further can be written as the product measure of jig, fJ'u and 
He- Set 

Bi{0, p) = {x e E"- / \x\ < p}, p > 0,i = s, u, c. 

Denote by W-Wnf (resp. "H^-norm is the stable direction (resp. unstable- 

central direction). Following ideas of the proof of Proposition III. 3 in Le Borgne (1999) (see 
also Le Borgne & Pene 2005), one can prove a result concerning the good distribution of the 
stable leaves in the torus. 

Lemma 5. (good distribution of stable leaves) There exist 9 < 1 such that for all 4> G 'H'^ , 
X G T'^ and p> 0, 

1 

Ps{T-^Bs{0,p)) 

Let ^0 be a sub-cr-algebra of the Borelian one for which the atoms are pieces of stable 
leaves and set An = T~'^Ao. Let 4> and ip be two C^'^-function with zero mean. 

Cov((/), o T") = Cov(0-E((/)|^_L|j),'0or") + Cov(E((/)|^_L|j),V'or") 
< UhU - E(,/.|^_Ltj)lloo + ||</'||i||E(^|^rtl)ll-- 
But, since the diameter of the atoms of A-n decreases exponentially fast, 

U-Ki<p\Ak)\\oo<C\mnfX-'^ 

and, by Lemma [Sj 

m^|;\A^n^)\\^<C\mnzJ^. 
Thus, for 7 = max{A, 0} 2 < 1, we get 

icov(</<, V o r")i < cmiMn? + wmmnO^''- (i^) 



T-"Bs{0,p) 



(/.(r-"x + s)ds 



< 
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Further, for all n > and for all G we have 

||0or-||^. <c||,/)||^.. (17) 

Indeed, if x G T"' and v ^ E^, hy linearity of the map T, 

|0or"(x) -</)or"(x + t;)| = |0or"(x)-</)(r"(x)+T"(t;))| 



In the same way, we get 

UoT-^nz,.<Cn^nns,.. (18) 
Now, to prove the proposition, we apply what precedes to 

m p 

<P = Y[^oT''^ and V = n ° 
with negative ki and positive k. Using PT|) and (fTSj) . the computation shows that 



and 



r 

'J I 



Then, by p^ . the proposition is proved. □ 
Further the classical central limit theorem holds for any Holder functions of the torus 
(see Leonov 1960, Le Borgne 1999)(See Leonov (1960); Le Borgne (1999)). Then Theorem[3] 
applies and we get Theorem [2l 
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